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DETAILED ACTION 
Response to Arguments 

1 . Applicant's arguments filed 1 1/25/2008 have been fully considered but they are 
not persuasive. 

2. In further review of the present application, the new limitation regarding "these 
parameters are calculated in advance or runtime" in independent claims is not fully 
supported by the original disclosure. Although there is indication in the original 
disclosure that the parameters are calculated in advance {paragraph 23 of the published 
application), the limitation "advance of runtime" is not fully disclosed. Therefore, it is 
reasonable to interpret "calculated in advance" as parameters being calculated before 
the current parameters subjected to normalization. In this case, Haverinen et al. (USPN 
7197456) fully teach that normalization mean and standard deviation parameters are 
calculated from the training data before they are used to normalize mean and standard 
parameters of the testing speech (col. 6, lines 20-38; the standard deviation and mean 
of the training speech are computed before they are used to normalize the mean and 
standard deviation of the test signal). 

3. Regarding 9-1 0 and 1 9-20, input speech is inherently corrupted with noise of 
different power levels due to imperfect communication channels and non-stationary 
noise. It is unclear from the claims how the different power levels of noise are factored 
into the calculation of mean and standard deviation parameters. 
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4. Regarding claims 3-7 and 13-17, Carey is only relied upon for the teaching of 
MLP neural network and not what the MLP neural network is used for. It would have 
been obvious to one of ordinary skill in the art at the time of invention to readily 
recognize that by combining the teaching of Carey with Sukkar and Haverinen et al., 
one would obtain the claimed invention. 

5. The previous office action has been vacated in favor of a new non-final office 
action. 

Claim Rejections - 35 (JSC § 1 12 

6. The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

7. Claims 1 and 1 1 are rejected under 35 U.S.C. 112, first paragraph, as failing to 
comply with the written description requirement. The claim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to 
one skilled in the relevant art that the inventor(s), at the time the application was filed, 
had possession of the claimed invention. The limitation "calculated in advanced of 
runtime" is not fully supported by the original disclose. Although there is indication in 
the original disclosure that the parameters are calculated in advance {paragraph 23 of 
the published application), the limitation "advance of runtime" is not fully disclosed. 
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Therefore, it is reasonable to interpret "calculated in advance" as parameters being 
calculated before the current parameters subjected to normalization. 

Claim Rejections - 35 USC §101 

8. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

9. Claims 1 and 3-10 are rejected under 35 U.S.C. 101 because the claimed 
invention is directed to non-statutory subject matter. 

1 0. Claims 1 and 3-1 0 are rejected under 35 U.S.C. 1 01 as not falling within one of 
the four statutory categories of invention. While the claims recite a series of steps or 
acts to be performed, a statutory "process" under 35 U.S.C. 101 must (1) be tied to 
another statutory category (such as a particular apparatus), or (2) transform underlying 
subject matter (such as an article or material) to a different state or thing (Reference the 
May 15, 2008 memorandum issued by Deputy Commissioner for Patent Examining 
Policy, John J. Love, titled "Clarification of 'Processes' under 35 U.S.C. 101"- publicly 
available at USPTO.GOV, "memorandum to examining corps"). The instant claims 
neither transform underlying subject matter nor positively tie to another statutory 
category that accomplishes the claimed method steps, and therefore do not qualify as a 
statutory process. For example, the steps of extracting, inputting, segmenting, 
generating, utilizing, combining, and comparing are not "tied to" any particular apparatus 
or machine. 
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Claim Rejections - 35 USC § 103 

1 1 . The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1 2. Claims 1,8-11, and 1 8-20 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Sukkar (USPN 6292778) in view of Haverinen et al. (USPN 
7197456). 

1 3. Regarding claim 1 and 1 1 , Sukkar discloses a method and system (figure 4) for 
utterance verification comprising the steps of: 

(A) extracting a sequence of feature vectors from an input speech {col. 8, line 60 
to col. 9, line 19 or referring to element 310 in figure 3); 

(B) inputting the sequence of feature vectors to a speech recognizer for obtaining 
at least one candidate string (speech recognizer 312 in figure 3 produces a number of 
recognition candidates and sub-word segmentation; also referring to col. 9, line 59 to 
col. 10, line 19); 

(C) segmenting the input speech into at least one speech segment according to 
the content of candidate string, which comprises individual recognition units, wherein 
each speech segment corresponds to a recognition unit and each recognition unit 
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corresponds to a verification unit {speech recognizer 312 in figure 3 produces a number 
of recognition candidates and sub-word segmentation 312; or also referring to col. 9, 
line 59 to col. 10, line 19); 

(E) utilizing a verification-unit corresponded classifier for each speech segment 
to calculate the verification score, where the sequence of verification feature vectors of 
the speech segment is used as the input of the classifier (col. 10, line 58 to col. 1 1, line 
32 or referring to the operation of figure 4)\ 

(F) combining the verification scores of all speech segments for obtaining an 
utterance verification score of the candidate string (col. 10, line 58 to col. 1 1, line 32 or 
referring to the operation of figure 4); and 

(G) comparing the utterance verification score of the candidate string with a 
predetermined threshold so as to accept the candidate string if the utterance verification 
score is larger than the predetermined threshold (col. 1 1, lines 1-32 or referring to the 
operation of figure 4). 

While the system of Sukkar inherently includes the step of normalizing extracted 
speech features for speech recognizer, Sukkar fails to explicitly disclose (D) generating 
a sequence of verification feature vectors for each speech segment according to the 
sequence of feature vectors of the speech segment, wherein the verification feature 
vectors are generated by normalizing the feature vectors using the normalization 
parameters of the verification unit corresponding to the speech segment, the 
normalization parameters of the verification unit are means and standard deviations of 
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the feature vectors corresponding to the verification unit in training data, and these 
parameters are calculated in advance of runtime. 

However, Haverinen et al. teach (D) generating a sequence of verification feature 
vectors for each speech segment according to the sequence of feature vectors of the 
speech segment, wherein the verification feature vectors are generated by normalizing 
the feature vectors using the normalization parameters of the verification unit 
corresponding to the speech segment, the normalization parameters of the verification 
unit are means and standard deviations of the feature vectors corresponding to the 
verification unit in training data, and these parameters are calculated in advance of 
runtime (col. 6, lines 20-38; the standard deviation and mean of the training speech are 
computed before they are used to normalize the mean and standard deviation of the 
test signal). 

Since Kukkar and Haverinen et al. are analogous art because they are from the 
same field of endeavor, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Sukkar by incorporating the teaching of Haverinen et al. 
in order to improve speech recognition and verification accuracy. 

1 4. Regarding claims 8 and 1 8, Sukkar further discloses the method and system as 
claimed in claims 1 and 11, respectively, wherein in step (F), the utterance verification 
score of the candidate string is the mean of the verification scores of the speech 
segments in the input speech (col. 1 1, lines 5-15, averaging subword scores). 
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1 5. Regarding claims 9 and 1 9, Sukkar further discloses the method and system as 
claimed in claims 1 and 11, respectively, wherein the input speech is corrupted by noise 
with different power levels of SNR (figure 1, speech is received via a telephone network, 
thus, a certain amount of noise must be added to the speech signal when received). 

1 6. Regarding claims 1 0 and 20, Sukkar further disclose the method and system as 
claimed in claims 6 and 16, wherein the speech segments used for training are 
corrupted by noise with different power levels of SNR (speech recognizer 312 in figure 3 
contains HMM models that are trained in advance; the speech used to train the speech 
recognizer 312 in advance inherently includes noise generated by either microphone of 
communication channel). 

1 7. Claims 3-7 and 1 3-1 7 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Sukkar (US 6292778) in view of Haverinen et al. (USPN 7197456), and further in 
view of Carey et al. (US 5526465). 

1 8. Regarding claims 3 and 1 3, Sukkar fails to specifically disclose the method and 
system as claimed in claims 1 and 11, respectively, wherein in step (E), the classifier is 
a neural network, and the neural network is an multi-layer perceptron (MLP). However, 
Carey et al. teach that the classifier is a neural network, and the neural network is a 
MLP (col. 11, lines 1-20). 
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Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to increase discrimination between world models and personal models to improve the 
verification process. 

1 9. Regarding claims 4 and 1 4, Sukkar further discloses that the verification score of 
a speech segment is the mean of the verification scores of the sequence of verification 
feature vectors corresponding to the speech segment {col. 11, lines 5-15, averaging 
subword scores). Sukkar, however, fails to specifically disclose wherein the MLP is 
used to calculate the verification score by inputting the verification feature vector and 
performing the feed-forward processing. However, Carey et al. teach wherein the MLP 
is used to calculate the verification score by inputting the verification feature vector and 
performing the feed-forward processing {col. 9, lines 43-50, forward and backward pass 
calculations). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to improve speech recognition accuracy. 

20. Regarding claims 5 and 1 5, Sukkar fails to specifically disclose the method and 
system as claimed in claims 3 and 13, respectively, wherein the MLP is trained by using 



Application/Control Number: 10/628,361 Page 10 

Art Unit: 2626 

an error back-propagation algorithm to reduce the mean square error between the 
verification score output of the MLP and the target value. However, CAREY ET AL. 
teach wherein the MLP is trained by using an error back-propagation algorithm to 
reduce the mean square error between the verification score output of the MLP and the 
target value {col. 11, lines 1-20 and col. 12, lines 1-20). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al.. in order 
to increase discrimination between world models and personal models to improve the 
verification process. 

21 . Regarding claims 6 and 1 6, Sukkar fails to specifically disclose the method and 
system as claimed in claims 5 and 15, respectively, wherein the MLP corresponding to 
the verification unit is trained by inputting the sequences of verification feature vectors 
of the speech segments corresponding to the verification unit and the sequences of 
verification feature vectors of the speech segments not corresponding to the verification 
unit. However, Carey et al. teach wherein the MLP corresponding to the verification unit 
is trained by inputting the sequences of verification feature vectors of the speech 
segments corresponding to the verification unit and the sequences of verification feature 
vectors of the speech segments not corresponding to the verification unit {col. 10, line 
43 to col. 11, line 34). 
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Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to train the speech recognizer to recognizer the speaker with high accuracy. 

22. Regarding claims 7 and 1 7, Sukkar fails to specifically disclose the method and 
system as claimed in claims 6 and 16, wherein the target value is 1 if the speech 
segment corresponds to the verification unit and which is 0 if the speech segment does 
not correspond to the verification unit. However, Carey et al. teach wherein the target 
value is 1 if the speech segment corresponds to the verification unit and which is 0 if the 
speech segment does not correspond to the verification unit (col. 1 1, line 60 to col. 12, 
line 7, P p =0 and P w =1, that means the speaker is not verified; and P p =1 and P w =0 
means the speaker is verified). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to verify speaker with high accuracy. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to HUYEN X. VO whose telephone number is (571)272- 
7631 . The examiner can normally be reached on M-F, 9-5:30. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on 571-272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 



/Huyen X Vo/ 

Primary Examiner, Art Unit 2626 
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